# Image Semantic Segmentation

Coco Panoptic Eomt Large 1280
MIT
This paper proposes a novel perspective by treating Vision Transformer (ViT) as an image segmentation model and explores its potential in image segmentation tasks.
Image Segmentation
C
tue-mps
119
0
Coco Panoptic Eomt Giant 1280
MIT
By rethinking the architecture of Vision Transformer (ViT), this model demonstrates its potential in image segmentation tasks.
Image Segmentation PyTorch
C
tue-mps
90
0
Segformer B0 Scene Parse 150
Other
Lightweight image segmentation model based on MIT-B0 architecture, optimized for scene parsing tasks
Image Segmentation Transformers
S
univers1123
20
0
Segformer B4 Crack Segmentation Dataset
Other
A crack segmentation model based on the SegFormer architecture, fine-tuned on a crack segmentation dataset for detecting crack structures in images
Image Segmentation Transformers English
S
varcoder
200
0
Segformer B0 Finetuned Segments Stamp Verification
Other
A semantic segmentation model fine-tuned on stamp verification dataset based on nvidia/mit-b0, used for precise segmentation of stamp regions in images
Image Segmentation Transformers
S
bilal01
82
2
Segformer B0 Finetuned Segments Test
Other
An image segmentation model fine-tuned on the bilal01/stamp-verification-test dataset based on nvidia/mit-b0
Image Segmentation Transformers
S
bilal01
15
0
Beit Base Finetuned Ade 640 640
Apache-2.0
BEiT is a model based on the Vision Transformer (ViT) architecture, pre-trained on ImageNet-21k through self-supervised learning and fine-tuned on the ADE20k dataset, specifically designed for image semantic segmentation tasks.
Image Segmentation Transformers
B
microsoft
1,645
11
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase